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wo: 

^ , We consider the problem of static assortment optimization, where the goal is to find the assortment of size at 

most C that maximizes revenues. This is a fundamental decision problem in the area of Operations Manage- 
ment. It has been shown that this problem is provably hard for most of the important families of parametric 
£ — ' of choice models, except the multinomial logit (MNL) model. In addition, most of the approximation schemes 

proposed in the literature are tailored to a specific parametric structure. We deviate from this and propose 
a general algorithm to find the optimal assortment assuming access to only a subroutine that gives revenue 
predictions; this means that the algorithm can be applied with any choice model. We prove that when the 
' underlying choice model is the MNL model, our algorithm can find the optimal assortment efficiently. 



1. Introduction 

This paper deals with the application of choice models to make decisions. There are several important 
practical applications where the end- goal is to make a decision, and a choice model is a critical 
component to making that decision. The main application area of our focus is the set of decision 
■ problems faced by operations managers. In this context, a central decision problem is the static 

assortment optimization problem in which the goal is to find the optimal assortment: the assortment of 
products with the maximum revenue subject to a constraint on the size of the assortment. Solving the 
decision problem requires two components: (a) a subroutine that uses historical sales transaction data 
to predict the expected revenues from offering each assortment of products, and (b) an optimization 
algorithm that uses the subroutine to find the optimal assortment. This paper deals with desigining 
^ ' an efficient optimization algorithm. 

As one can imagine, the problems of predicting revenues and finding the optimal assortment are 
important in their own right, and their consideration is motivated by the fact that any improvements 
to existing solutions will have significant practical implications. Specifically, solutions to these two 
problems lead to a solution to the single-leg, multiple fare-class yield management problem; this 
problem is central to the area Revenue Management (RM) and deals with the allocation of aircraft 
seat capacity to multiple fare classes when customers exhibit choice behavior. In particular, consider 
an airline selling tickets to a single- leg aircraft. Assume that the airline has already decided the 
fare classes and is trying to dynamically decide which fare-classes to open as a function of the 
remaining booking time and the remaining number of seats. This dynamic decision problem can be 
ca st in a reasonably straightforw ard manner as a dynamic program with one state variable. As shown 
in iTalluri and van Rvzinl (|2004l ) , the solution to the dynamic program reduces to solving a slight 
variant of the static assortment optimization problem. Thus, solution to the two problems effectively 
solves the single-leg, multiple fare-class yield management problem aL" a central problem to RM 
with huge practical implications. 
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Given the subroutine to predict revenues, we need an efficient algorithm to search for the optimal 
assortment. In particular, we are interested in solving 

argmax R(A4), 

\M\<c 

where R(A4) is the expected revenue from offering assortment A4. In this chapter, we assume access 
to a subroutine that can efficiently generate revenue predictions for each assortment A4 , and our goal 
is to design an optimization algorithm that minimizes the number of calls to the subroutine. The 
revenue predictions can themselves be generated either using a specific parametric choice model or 
using the nonparametric approach described in the previous chapter. Assuming there are TV products 
and a constraint of C on the size of the optimal assortment, exhaustive search would require 0(N ) 
calls to the revenue subroutine. Such an exhaustive search is prohibitive in practice whenever N or 
C is large. Therefore, our goal is to propose an algorithm that can produce a "good" approximation 
to the optimal assortment with only a "few" calls to the revenue subroutine. Existing approaches 
focus on exploiting specific parametric structures o f choice models to solve the decision problem 



efficiently. In this context, iRusmevichientong et al.l (|2010al ) have proposed an efficient algorithm 



to find the optimal assortment in 0{NC) operations whenever the underlying model is the MNL 
model. Unfortunately, beyond the simple case of the MNL model, the optimization problem or its 
vari ants are provably hard ( like the NL and MMNL models; see IRusmevichientong et al. I (l2009t ) 



and Rusmevichientong et al. ( 2010bl )). In addition, the algorithms proposed in the literature (both 



exact and approximate) heavily exploit the structure of the assumed choice model; consequently, the 
existing algorithms - even without any guarantees - cannot be used with other choice models like the 
probit model or the mixture of MNL models with a continuous mixture. Given these issues, our goal 
is to design a general optimization scheme that is (a) not tailored to specific parametric structures 
and (b) requires only a subroutine that gives revenue estimates for assortments. 

Overview of our approach. We propose a general set-function optimization algorithm, which 
given a general function defined over sets, finds an estimate of the set (or assortment) where the 
function is maximized. This set-function optimization algorithm clearly applies to the static assort- 
ment optimization problem, thereby yielding the optimization scheme with the desired properties. 
Note that since we are considering a very general setup, there is not much structure to exploit. 
Hence, we adopt the greedy method - the general technique for designing heuristics for optimization 
problems. However, a naive greedy implementation algorithm fails even in the simple case of the 
MNL model. Specifically, consider the simpler un-capacitated decision problem. Here, a naive greedy 
implementation would start with the empty set and incrementally build the solution set by adding at 
each stage a product that results in the maximum increase in revenue; this process would terminate 
when addition of a product no longer results in an increase in revenue. It is easy to see that the 
naive implementation would succeed in solving the decision problem only if the optimal assortments 
exhibit a nesting property: the optimal assortment of size C± is a subset of the optimal assortment 
of size C 2 whenever Ci < C 2 . Unfortunately, the nesting property does not hold even in the case 
of the MNL model. In order to overcome this issue, we allow for greedy "exchanges" in addition to 
greedy "additions." Particularly, at every stage, we allow a new product to be either added (which we 
call an "addition") to the solution set or replace an existing product (which we call an "exchange") 
in the solution set; the operation at each stage is chosen greedily. The termination condition now 
becomes an interesting question. As in the naive implementation, we could terminate the process 
when addition or exchange no longer results in an increase in revenue. However, since we never run 
out of products for exchanges, the algorithm may take an exponential (in the number of products) 
number of steps to terminate. We overcome this issue by introducing a control parameter that caps 
the number of times a product may be involved in exchanges. Calling that parameter b, we show that 
the algorithms calls the revenue subroutine 0(N 2 bC 2 ) times for the capacitated problem. We thus 
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obtain a general algorithm with the desired properties to solve the static assortment optimization 
problem. 

Guarantees for our algorithm. We derive guarantees to establish the usefulness of our opti- 
mization procedure. For that, we first consider the case of the MNL model, where the decision 
problem is well-understood. Specifically, we assume that the underlying choice model is an instance 
of the MNL family and the revenue subroutine yields revenue estimates for assortments under the 
specific instance. We can show that the the algorithm we propose, when run with b>C, succeeds 
in finding the optimal assortment with 0(N 2 C 3 ) calls to the revenue subroutine. Therefore, in the 
special case when the underlying choice model is the MNL model, our algorithm captures what is 
already known. It also provides a simpler alternative to the more complicated algorithm proposed by 



Rusmevichientong et al.l (|2010af ) . We also consider the case when noise corrupts the available revenue 



estimates - a common practical issue. In this case, we show that our algorithm is robust to errors in 
the revenue estimates produced by the subroutine. Particularly, if the underlying choice model is the 
MNL model and the revenue estimate produced by the subroutine may not be exact but within a 
factor 1 — e of the true value, then we can show that our algorithm finds an estimate of the optimal 
assortment with revenue that is within 1 — /(e) of the optimal value; here /(e) goes to zero with 
e and also depends on C and the parameters of the underlying model. In summary, our theoretical 
analysis shows that our algorithm finds the exact optimal solution in the noiseless case or a solution 
with provable guarantees in the noisy case, whenever the underlying choice model is the MNL model. 
In this sense, our results subsume what is already known in the context of the MNL model. 

In the context of the more complicated models like the nested logit (NL) and the mixtures of MNL 
models, the decision problem is provably hard. As discussed above, even obtaining a PTAS can be 
very complicated and requires careful exploitation of the structure. We however believe that it is 
possible to obtain "good" approximations to the optimal assortments in practice. 

Organization. Next, we describe in detail the optimization algorithm we propose and the guar- 
antees we can provide. The rest of the chapter is organized as follows. The optimization algorithm, 
which we call GreedyOPT is described in Section [2J We then describe the precise guarantees we 
can provide on the algorithm in Section [3) Finally, we present the proofs of our results in Section 0] 
before concluding in Section [5l 



2. Description of GreedyOPT 

We now provide the detailed description of our optimization algorithm GreedyOPT. As noted 
above, most of the algorithms proposed in the literature - both exact and approximate - are based on 
heavily exploiting the structure of the assumed choice model. Unfortunately, since we are considering 
a very general setup, there is not much structure to exploit. Hence, we adopt the greedy method - 
the general technique for designing heuristics for optimization problems. 

A naive greedy implementation however fails even in the simple case of the MNL model. Specifically, 
consider the simpler un-capacitated decision problem. Here, a naive greedy implementation would 
start with the empty set and incrementally build the solution set by adding at each stage a product 
that results in the maximum increase in revenue; this process would terminate when addition of a 
product no longer results in an increase in revenue. It is easy to see that the naive implementation 
would succeed in solving the decision problem only if the optimal assortments exhibit a nesting 
property: the optimal assortment of size C\ is a subset of the optimal assortment of size C 2 whenever 
C\ <C 2 - Unfortunately, the nesting property does not hold even in the case of the MNL model. 

In order to overcome this issues associated with the naive greedy implementation, we allow for 
greedy "exchanges" in addition to greedy "additions." Particularly, at every stage, we allow a new 
product to be either added (which we call an "addition") to the solution set or replace an existing 
product (which we call an "exchange") in the solution set; the operation at each stage is chosen greed- 
ily. The termination condition now becomes an interesting question. As in the naive implementation, 
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we could terminate the process when addition or exchange no longer results in an increase in revenue. 
However, since we never run of products for exchanges, the algorithm may take an exponential (in 
the number of products) number of steps to terminate. We overcome this issue by introducing a 
control parameter that caps the number of times a product may be involved in exchanges. Calling 
that parameter b, we show that the algorithms calls the revenue subroutine 0(N 2 bC 2 ) times for the 
capacitated problem. We thus obtain a general algorithm with the desired properties to solve the 
static assortment optimization problem. 

The formal description of the algorithm is provided in Figures [2] and [JJ For convenience, whenever 
an exchange takes place, we call the product that is removed as the product that is exchanged-out and 
the product that is introduced as the product that is exchanged-in. Now, the algorithm takes as inputs 
the capacity C, the initial assortment size S, and a bound b on the number of exchange-outs. The 
algorithm incrementally builds the solution assortment. Specifically, it searches over all assortments 
of size S. For each such assortment, the algorithm calls the subroutine GreedyADD-EXCHANGE 
(formally described in Figure [2J at most C — S times to construct an assortment of size at most C. 
Of all such constructed assortments, the algorithm returns the one with the maximum revenue. 

Figure 1 GreedyOPT 

Input: Initial size S, capacity constraint C such that 1 < S < C < N, and revenue function R(-). 
Output: Estimate of optimal assortment M OPT of size |M OPT | < C 

Algorithm: 

Initialization: M OPT <s— 

for each M C M such that \M\ = S 

for S + l<i<C 

M ^GreedyADD-EXCHANGE(A4, Af, b, R{-)) 

end for 

if R(M OPT ) < R(M) 

MOPT ^ M 

end if 
end for 

Output: M OPT 



Running-time complexity: It is easy to see that the number of times GreedyOPT calls the 
revenue function R(-) is equal to (C — S) (g) times the number of times GreedyADD-EXCHANGE 
calls the revenue function. In order to count the number of times GreedyADD-EXCHANGE 
calls the revenue function R(-), we first count the number of times the while loop in GreedyADD- 
EXCHANGE is executed. The number of times the while loop runs is bounded above by the 
maximum number of iterations before the set M becomes empty In each iteration either an addition 
or an exchange takes place. Since there is at most one addition that can take place and \J\f\ decreases 
by 1 whenever exchange-outs(i) of a product i reaches b, it follows that the while loop runs for at most 
Nb + 1 iterations. In each iteration of the while loop, the revenue function is called at most 0(CN) 
times. Thus, GreedyADD-EXCHANGE calls the revenue function at most 0(CbN 2 ) times. Since 
(g) = 0(N S ), we can now conclude that GreedyOPT calls the revenue function 0(C 2 bN s+2 ). The 
choice of S will depend on the accuracy of revenue estimates we have access to. Next, we provide 
guarantees on GreedyOPT, which provide guidance on the choice of S. 
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Figure 2 GreedyADD-EXCHANGE 



Input: assortment A4, product universe Af, revenue function R(-), maximum number of exhange- 
outs b 

Output: Estimate of optimal assortment of size at most \A4\ + 1 
Algorithm: 

Initialization: A4 <— A4 , AT Af, exchange-outs(i) = for each i £ Af 
while AV0 

/ /try exchanging products 

= argmax^ ^i? ((A4 \ {i}) U {j}) 

M exchange 

1 1 try adding a product 

k* = arg max fcgJ ^ R(A4 U { k}) 

M add ^MU{k*} 

if \M\ < \M\ + 1 and #(A( add > and i?(A( add ) > i?(A( cxchangc ) 

/ / add the product k* 

M <~ A^add 

M^A7\{k*} 
else if i?(Af oxch angc) > R(M) 
1 1 exchange products i* and j* 

AA. i AA. exchange 

exchange-outs(i*) exchange-outs(i*) + 1 
if exchange-outs(i) > b 
Af^N\{f} 

M^(M\{r})u{n 

else 

break from while 
end if 
end while 

Output: Ai 



3. Theoretical guarantees for GreedyOPT 

We now give a precise description of the main results we can establish for the GreedyOPT algo- 
rithm. Specifically, suppose that the underlying choice model is an MNL model with weights w = 1 
for product and Wi for product i E Af; recall that the choice probabilities are given by 

in ■ 

¥{i\M) 



1 + E 3 eM w j 

Note that 1 appears in the denominator because of the no-purchase option. In particular, the proba- 
bility that an arriving customer leaves without purchasing anything when assortment Ai is on offer 
is given by 

F{0\M) = - — =^ . 

1 + 



() 
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Let R{M) denote the expected revenue from assortment A4. Under the MNL model, we have 

E PWi 
R(M) '-- M 



1+ E Wi 

i£M 



where pi is the price or the revenue obtained from the sale of product i. 

We now have the following theorem when the revenue subroutine provides exact revenues: 

Theorem 1. Suppose the underlying model is the MNL model with weights W\, w 2 , ■ ■ ■ , and the 
revenue subroutine provides exact revenues. Then, for any S > and b>C + \, the GreedyOPT 
algorithm finds the optimal solution to CAPACITATED OPT problem. 

Therefore, taking S = and b = C + 1, GreedyOPT finds the optimal assortment of size at most 
C by calling the revenue function 0(N 2 C 3 ). Thus, our algorithm provides a simpler alternative to 



the more complicated algorithm proposed by iRusmevichientong et al.l (|2010al ) 



We next show that the GreedyOPT algorithm is robust to errors in the available revenue esti- 
mates. Specifically, we consider the more realistic setting where one has access to only approximate 
estimates of revenues i.e., we assume access to a function R(-) such that for any assortment M we 
have 

(1 - e(M))R(M) < R(M) < R(M) 

for some parameter < e(M) < 1. Naturally, the parameter e{Ai) determines the quality of revenue 
estimates we have available. Assuming that we have access to only approximate revenues, we find 
the optimal assortment by running GreedyOPT with approximate revenues. In order to describe 
the result, we need some notation. For any assortment A4, let w(A4) denote 1 + J2ieM w i- Further, 
let 

e max = max e(M) and Wr ax = max w(M). 

M: \M\<C M: \M\<C 

Finally, we defer to the next section the precise definitions of two quantities C(8 C ) and 5 C that we 
need to describe the theorem; it suffices to say that as e max — > 0, we have 8c — > and C(8c) —> C. 
With these definitions, we can now state our result. 

Theorem 2. Let Afp PT denote the optimal assortment of size at most C and M^ PT denote the 
estimate of the optimal assortment produced by GreedyOPT when run with inputs S > and 
b > C(25c) + 1. Then, we must have 

R{Mg PT ) - /l ' 

where w denotes the vector of weights (u>i,W2, ■ • ■ , w N ) and 



ma,\ 



with r?(e max ) d = 4Ce max / (1 - e z 



It is easy to see that the algorithm calls the revenue function 0(N 2 C 2 C(2S C )) times. Note that as 
£max 0, v( £ max) an d hence f(w, e max ) go to zero. In addition, it follows from our definitions that 
as e max — > 0, C(28 c ) —> C. Consequently, taking the error in revenues e max = yields in Theorem [2] 
yields the result of Theorem [Q as the special result. Therefore, we only prove Theorem [2] in the next 
section. 
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4. Proofs of the main results 

In this section we prove Theorem [21 specifically, we establish that the revenues of the optimal assort- 
ment and the estimate of the optimal assortment produced by GreedyOPT are "close". In order to 
establish this result, for the rest of the section, fix a capacity C. Let M OPT and M OPT respectively 
denote the optimal assortment and the estimate of the optimal assortment produced by Greedy- 
OPT. Then, our goal is to show that R(M OPT ) and i?(M OPT ) are "close" to each other. We assume 
that the underlying choice model is the MNL model with parameters W\, W2, ■ ■ ■ , ifjv- Recall that for 
any assortment M, 

E Pi w t 

R(M) '~- M 



1 + E Wi ' 

where p.j is the price of product i. The term in the denominator makes comparison of the revenues of 
two different assortment difficult. Therefore, instead of dealing with the revenues of the assortment 
directly, we consider the following transformation of the revenues of assortments: for any assortment 
M and number u6R, 



R{M) 



E PiWi E (Pi - u ) w ' 

ieM ^ _ \ieM 

1+ E w 4 1+ E Wi 

ieM ieM 
H M (u)-u 



w(M) ' 

where H M : R — > K is a function defined as H M (u) = Eiewi (Pi ~ u ) w i andw(M) = 1 + E %eM w i- We 
can now write 

H M (u) = u + w(M)(R(M)-u). (1) 

It is clear that H M {-) is directly related to the revenue R(M). Moreover, as will become apparent 
soon, it is easier to compare the transformations H Ml {-) and H M2 (-) of two assortments Mi and 
M2 than their revenues R(Mi) and R(M2)- Specifically, we can establish the properties stated in 
the following proposition. 

PROPOSITION 1. For any two assortments Mi and M2, let H±(-) and H 2 (-) respectively denote 
the functions H Ml (-) and H M2 {-). Further, let Ui and u 2 denote the revenues R(Mi) and R^M.^) 
respectively. We then have 

1. Hi(u 2 ) > H 2 (u 2 ) R{Mi)>R(M 2 ). 

2. Hi(u 2 ) > (1 + 5{Mi))H 2 (u 2 ) => R{Mi)>R{M 2 ), 
where 5{M X ) = e{M x )w{Mi) / {I -s(M x )). 

W e prove each of the properties in turn. First note that for any assortment M with revenue 
R(M) = u, it immediately follows from our definitions that H M (u) = u + w(M)(R(M) — u) = u. The 
first property now follows from a straightforward expansion of the terms involved: 

Hi(u 2 ) > H 2 (u 2 ) <^=^> u 2 + w(Mi)(ui - u 2 ) > u 2 

^> Ui > u 2 

^ R(Mi)>R(M2) 1 

where the second equivalence follows from the fact that w{Mi) > 0. The second property can also 
be obtained through a similar straightforward expansion of the terms. In particular, 

Hi(u 2 ) > {l + e(Mi))H 2 {u 2 ) ^u 2 + w{Mi)(ui - u 2 ) > (1 + S(Mi))u 2 



s 
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(l-e(M 1 ))u 1 >u 2 , (2) 

where the second equivalence follows from the definition of 5 (Mi). Moreover, it follows from our 
definitions that R(Mi) > (1 — e(Mi))u 1 and u 2 > R(M 2 ). We now conclude from $2) that 

R(Mi) > (l-e(M 1 ))u 1 >u 2 >R(M 2 ). 

The result of the proposition now follows. 

The above proposition establishes that if the transformation H M (-) of one assortment is "suffi- 
ciently" larger than the other, then it follows that the revenues of one assortment should be larger 
than the revenues of the other. Therefore, instead of keeping track of the revenues of the assortments 
in our algorithm, we keep track of their respective transformations H M (-). 

Next, we establish a loop-invariance property that arises due to greedy additions and exchanges in 
our algorithms. We make use of this property to prove our theorems. In order to state the proposition, 
we introduce the following notation: 

t dd f( , , , e(M) 

oc = max o(M) = max w(M)- 



M:\M\ M: \M\ l—e(M) 

We then have 

PROPOSITION 2. Consider an iteration t of the while loop of the GreedyADD-EXCHANGE 
algorithm. Let M t and M t +i denote the estimates of the optimal assortments at the beginning and 
the end of iteration t. Let Aft denote the universe of products at the beginning of iteration t. Then, 

1. if a greedy exchange takes place i.e., AA t +i — (M t \ {i*}) U {j*}, then for u = R(M t +i), we must 
have 

hi*(u) < hi(u) + Sou, for all i G A4 t 

hj* (u) > hj (u) — 5 c u, for all j G Af t \ A4 t ; 

2. if an addition takes place i.e., A4 t +i = A4 t U {j*}, then for u = R(Ad t +i) we must have 

hj,(u) > hj(u) - 5 c u, for all j G Af t \ M t . 

W e prove this proposition by contradiction. First consider the case when exchange happens i.e., 
M t+1 = (M t \ {i*}) U {j*}. Note that for any assortment M = (M t \ {i}) U {j} with i G M t and 
j EAft \ M t , letting u denote R(M t +i), we can write 

H M (u) - H Mt+1 (u) = hj (u) - hj* (u) + hi* (u) - h t (u) . (3) 

Now, if the hypothesis of the proposition pertaining to exchange is false, then at least one of the 
following should be true: either (1) there exists a product i € M t and i ^ i* such that hi* (u) > hi(u) + 
ScU, or (2) there exists a product j G Aft \ M t and j ^ j* such that hj* (u) < hj (u) + 5cU. In the first 
case when hi*(u) > hi(u) + 5 c u, by taking j = j* , we can write from ([3]) that H M (u) — H M (u) > 
5cu. Similarly, in the second case when hj* (u) < hj(u) + 5cu, by taking i = i* , we can write from ([3]) 
that H M (u) — H Mt+1 (u) > 8 c u. Therefore, in both the cases, we have exhibited an assortment M 
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distinct from M t +i that can be obtained from Ai t through an exchange and has the property that 
Hm( u ) ~ H Mt+1 { u ) > &c u - We can now write 

H M (u) > H Mt+1 (u) + S c u (4a) 

=> H M (u) > H Mt+1 (u) + 5{M)u since 5 C > S(M) by definition (4b) 

=> H M (u) > (1 + 5{M))H Mt+l (u) since H Mt+1 (u) = uby definition (4c) 

=> R{M)>R{M t+ i) by Proposition d (4d) 

This clearly contradicts the fact that A4 t +i is chosen greedily. 

The case when addition happens can be proved in the exact similar way. Particularly suppose 
there exists a product j G M t \Mt and j ^ j* such that h 3 *(u) < h 3 (u) — Scu, where u = R(A4 t +i) 
with M t +i = -Mt U {j}. Letting M denote the set A4 t U {j}, we can then write 

H M (u) ~ H Mt+1 (u) = hj(u) - hj* (u) > 8 c u. 

This implies - following the sequence of arguments in @ - that R(A4) > R(A4 t ), contradicting the 
fact that M.t+i is chosen greedily. 

The result of the proposition now follows. 

The above proposition establishes a key loop-invariance property that results from greedy additions 
and exchanges. Specifically, let u denote the revenue of the estimate of the optimal assortment 
obtained at the end of an iteration of the while loop in GreedyADD-EXCHANGE. Then, the 
proposition establishes that whenever a product j* is introduced (either through addition or an 
exchange-in) greedily, it must be that hj*(u) is "close" to the maximum hj(u) of all products j 
that have been considered for an addition or exchange-in. Similarly, the product i* that is greedily 
exchanged-out must be such that hi*(u) is "close" to the minimum h^u) of all products i that have 
been considered for an exchange-out. 

Using the propositions above, we can establish a key property of the subroutine GreedyADD- 
EXCHANGE. For that, we need the following notation. For any u, define 



Bs(u) d = arg max H M (u) = &rg max hAu). 

M-.\M\<S M: \M\<S f-f. 

'- i€M 



It is easy to see from the above definition that B s (u) consists of the top at most C products according 
to hi(u) such that hi(u) > 0. Since /i,(-) is monotonically decreasing, it is easy to see that 

\Bs{ u i)\ > \Bs(u 2 )\, whenever U\ < u 2 - (5) 



Under appropriate technical assumptions. iRusmevichientong et al. ( 2010a ) showed that for any 1 



< 

S < N, the optimal assortment of size at most S under the MNL model is one of the assortments in 
the collection Bs = {Bs(u) : u G M}. In fact the authors show that if us denotes the optimal revenue, 
then B s (us) is the optimal assortment. An immediate consequence of this result and ([5]) is that for 
any u<u s 

S>\B s (u)\>\M° PT \. (6) 



It has been established by IRusmevichientong et al.1 (l2010al ) that there can be at most 0{NC) distinct 



assortments in the collection Bs allowing one to find the optimal assortment by restricting one's search 
to O(NC) assortments. The following lemma shows that the assortment found by the subroutine 
GreedyADD-EXCHANGE is "close" to one of the assortments in Bs- Before we describe the 
lemma, we need the following notation. For any 5 > and u£i, let 

is(u)^= min hAu). 

i£B s {u) 



10 



Author: Assortment Optimization Under a Nonparametric Choice Model 
Management Science 00(0), pp. 000-000, ©0000 INFORMS 



Moreover, let 

B s (6,u) = B s {u) U {j eJ\f\B s (u): h is(u) (u) - hj(u) <8u}, 

Also, let 

C(5) = max.\B s (S,u)L 

We then have 

Lemma 1. Suppose GreedyADD-EXCHANGE is run with some input assortment M andb> 
C{5q) + 2, where C > 8 + 1. Further, suppose that \MgYy\ =5 + 1. Then, there exists an iteration 
t* of the while loop such that if M* denotes the assortment M t * +1 and u* denotes R(M*), then 

H B{uf) (u*)-H M *(u*)< 2C U * 5 c u* , 

where B{u*) denotes the assortment Bs+i(u*) and C* is a constant denoting 1 + \B{u*) \ 

We defer the proof of Lemma [T] to the end of the section. We now present the proof of Theorem [21 

4.1. Proof of Theorem [2] 

Let M° PT denote the true optimal assortment, and M° PT denote the estimate of the optimal assort- 
ment produced by GreedyOPT. Furthermore, let C* <C denote the size of M° PT . It follows from 
Lemma [T] that in the C*th invocation of the subroutine GreedyADD-EXCHANGE, there exists 
an assortment M* such that R(M° FT ) > R(M*) and M* is such that 

H B{un (O - H M * (O < 2C U * 5 c u* , 

where C* denotes \B{u*) \M*\ + \ and B{u*) denotes the set B c *{u*). It follows by the definition 
of B{u*) that Hb( u *){u*) > H m o?t(u*). Thus, we can write 

H m ovt{u*)-H m *{u*) < 2C u *5 c u* < 2C6 c u*. (7) 

Let u c denote i?(M° PT ). Then, it follows by definition that H m opt(u c ) = u C - Thus, 

H m o P t(uc)-H m o P t(u*)= w j (u*- Uc ) = (u*-u c )(w(M2 pt )-1). 

Since H m opt(u c ) = u c , we can write 

H m o P t(u*) = u c + {u c - u*)(w(MS PT ) - 1). (8) 

Since H M * (u*) = u* , it now follows from ([7]) and ([S]) that 

[u c - u*)(w{M° PT ) - 1) + u c - u* < 2C5 c u* 
=4> (u c - u*)w(M° PT ) < 2C5 c u* 

=► u c <(l + e)u% (9) 

where e = 2C5 c /w(Mg PT ). Now since i?(M° PT ) > R(M*), it follows that 

(1 - e(M*))u* < R(M*) < R(M OPT ) < u c , 
where u c denotes R(M° PT ). It now follows from © that 

^<(l + £ >*< 
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Now since 

d c = max 7T7\ W \M ), 

M-. \m\<g 1 —e(M) 

by letting e max = max^. \m\<c^{-M) and M / ( max = max^. \m\<cw(M), we have 

Sn < - W, 



Thus, 



:<5 C < , , r OPT , i ^c ax = f(w,e mKX )/2. 



w(M8 PT ) °-w(MS PT )l-e m ^ c 
With these definitions, it is easy to see that e(Ai*) < e max < /(w, e max )/2. It now follows that 

.o^ i _ i_£^ < £±£t^£) < E( ^.) + , < /(m , £m „). 

He 1 + £ 1 + £ 

This establishes the result of the theorem. 

4.2. Proof of Lemma Q] 

Suppose the while loop in the subroutine terminates at the end of iteration T. Then, it follows from 
the description of the subroutine that at least one of the following conditions holds at the end of 
iteration T: 

1. The set of products Nt+i \ M-t+x available for additions or exchanges is empty. 

2. No further additions or exchanges can increase the revenues. 
Our goal is to prove the existence of an iteration t* <T such that 

H B{u *){u) - H M »(u) < 2C u *8 c u*, 

where A4* denotes the assortment M t * +1 and u* denotes R(M*). We prove this by considering two 
cases corresponding to each of the two ways in which the subroutine terminates. Note that in order 
to simplify the notation, we have dropped the subscript from the notation of B s +i(-). 
Case 1: Subroutine terminates with 7V T+ i =A / 1t+i- We first consider the case when the sub- 
routine terminates when the set of products A/r+i \Mt+i becomes empty In this case, we prove the 
existence of an iteration t* <T that satisfies the condition stated in the hypothesis of the lemma. 
In fact, we prove something stronger; we shall show that the iteration t* < T* , where T* < T is the 
first iteration such that At* C W (recall that .A/i =7V). We prove this result by contradiction. In 
particular, suppose that after every iteration t<T* of the while loop, we have 

H B[u) (u) - H Mt+1 (u) > 2CJ c u, (10) 

where u denotes the revenue R{M. t +\) and C u denotes the constant 1 + \B(u) \A^t+i|- Note that a 
product i would be removed from the universe At at the end of some iteration t only if it has been 
exchanged-out b times. Since b > C(8c), it is easy to see that we arrive at a contradiction if we show 
that as long (|10|) is satisfied at the end of each iteration, each product i can be exchanged-out at 
most C(S C ) + 2 times. 

In order to bound the number of times a product can be exchanged-out, we establish a special 
property that should be satisfied whenever an exchange happens. Specifically, suppose an exchange 
happens during iteration t i.e., Mt+i = {.M-t\{}*Y) U {j*}- In addition, let u denote the revenue 
R(M t +i), and let product k* £ Ai \ M t denote the product such that h k * (n) > h k (u) for all products 
k € At \ M. t - Then, we claim that 

hj*{u)>h k *{u) — 5cu (H a ) 
hi*{u) <h k *{u) — S c u. (lib) 
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We prove this claim as follows. Since k* £ Af t \ A4 t , (jllap follows directly from Proposition [2j We 
now argue that hi* (u) < h k *{u) — 5cu. For that, we first note that 

hi*(u)-h j *(u)<25 c u. (12) 

To see why, note that since an exchange has happened, it must be that R(M t ) < R(A4 t+ i). This 
implies by Proposition Q] that H Mt (u) < (1 + 5{M. t ))H Mt+1 (u). Since 5{M.\) < 5 C and H Mt+1 (u) = u 
by definition, we can write 

H Mt (u)<(l + 6(M t ))H Mt+1 (u) =► H Mt (u)-H Mt+1 (u)<8 c u 

=>■ hi* (u) — hj* (u) < 8 c u < 25 c u. 

Now, consider 

H B (u) (u) - H Mt+1 (u) = H B[u) (u) - H Mt (u) + H Mt (u) - H Mt+1 (u) 

= h A u )~ ^N + (^(m)-V( m ))- 

j£B(u)\Mt ieM t \B(u) 

We now collect terms in the above expression as follows. Let Mi denote the set M t \ B(u). Further, 
partition the set B(u) \ M t into Mscr 2 U M 3 such that M 2 H M 3 = and \M 2 \ = \Mi\; note that 
such a partitioning is possible because = 3+ 1 (which follows from §U§ and the hypothesis 

that |Mg£ x T = 3 + 1|) and \M t \ < S + 1. Also note that A4 3 ^ if and only if \M t \ < S + 1. With 
this partitioning, we can now write 

H B (u)(u)-H Mt+1 (u)= (hj(u)-hi(u))+ ^•(«) + (/i i *(«)-V(«))- 

We now claim that at least on of the following must be true: either (1) there exists a pair of products 
i € M.\ and j £ Ai 2 such that h 3 (u) — h t (u) > 25 c u, or (2) if A4 3 ^ 0, then there exists a product 
k € .M3 such that h 3 (u) > 2S c u. Otherwise, it is easy to see from (fT2"]l that H B ^(u) — H Mt+1 (u) < 
2C u 8cU, where C u = \B(u) \ M t +i\ + 1, contradicting (fTU|) . We now consider each of the cases in 
turn. 

First suppose that hj(u) — hi(u) > 25 cu for some i G A4± and j G Ai 2 . It follows from Proposition [2] 
that (u) < hi(u) + 5cu. Thus, we can write 

hi* (u) <hi(u) + 5 c u < h 3 (u) — 2S c u + 5 c u< h k * ( u) — Squ, 

where the last inequality follows from the definition of k* and the fact that j GM 2 cA^\A^ ( . Thus, 
for this case, we have established (jllbp . 

Now suppose that A4 3 7^ and h k (u) > 25 c u for some k £ A4 3 . As noted above, in this case, we 
should have |.M t+ i| < 5 + 1. This means that an exchange has happened instead of addition, which 
in turn implies that R(M) < R(M t +i), where A4 denotes the set A4 t U {k}. Thus, by Proposition [lj 
we should have 



as H Mt+1 (u) = u,5(M) < 5 C 

since h k {u) > 28cu 
since hj* (n) < h k * (u). 



H^(u)<(l + 5(M))H Mt+1 (u) 
Hm («) ~ H Mt+1 (u) < 5(M)H Mt+1 (u) 
h k (u) + h,^ (u) — hj* (u) < Squ 
hi* (u) < hj* (u) — h k (u) + 5 c u 
hi* (u) < hj* (u) — 28 c u + 8cu 
hi* (u) < h k * (u) — 5 c u 
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We have thus established that hi* (u) < h k * (u) — 8qu for both the cases. 

We now use (jlip to bound the number of exchange-outs that can happen for each product. 
Specifically, as mentioned above, we arrive at a contradiction by showing that each product can be 
exchanged-out at most C(Sc) + 2 times. For that, for any iteration t <T* , let k t denote the product 
such that k t G M t \M t and h kt (u t+ i) > hj(u t+1 ) for all products j G M t \ Mscr t and u t+ \ = R(M t +i)- 
Now define the function 

g t u \ _ i h h( u ) ~ $cu for u t <u< u t+1 ,t< T*, 
\/i fcl (iti) -5 C U! foru = U!. 

Note that for the above definition to be meaningful, for any t <T* , we need to show that u t < u t+1 . 
This should be true because by (JXTJ) , it follows that for u = R(M t +i), we have hi*(u) < hj*(u); 
this in turn implies that H Mt {u) < H Mt+1 (u), which implies by Proposition Q] that u t = R(Ai t ) < 
R(A4 t+ i) = u t +\. It is easy to see that the function <?(•) is piecewise linear. However, note that it may 
not be continuous. 

Now fix a product i, and for this product we argue that it can be exchanged at most C(5 C ) times. 
For that let t\ be an iteration in which i is exchanged-out and t 2 be the first iteration after t\ when 
i is exchanged-in. Let U\, u 2 denote R(ftA tl+1 ) and R(A4 t , 2+ i) respectively. Furthermore, let k\ and 
k 2 respectively denote the products k tl and k t2 . It now follows from (jlip that 

hiiux) < h kl («i) - 5 c u t = g(ui) 
hi(u 2 ) > h k2 (u 2 ) - S c u 2 = g(u 2 ). 

This implies that the line hi(-) is below g(-) at Ui and above g(-) at u 2 . We now argue that hi(-) 
intersects g(-) at some U\ < u < u 2 i.e., hi(u) = g(u). If g(-) were continuous, this assertion would 
immediately follow from the intermediate value theorem. However, the way we have defined g(-), it 
may be discontinuous at some u t with t\<t< t 2 . Now the only way hi(-) and g(-) do not intersect 
is if for some t x <t<t 2 , 

g(itt) < hi{u t ) < g(u^) and hi(u) > g(u) for u t <u< u 2 . 

We argue that this cannot happen. For that consider iteration t. By definition i (ji A4 t . Since Af t = A/", 
it follows by our definition that h kt (u t+ i) > which in turn implies that g(u t+ i) > h t (u t+1 ) 

resulting in a contradiction. Thus, intersects g(-) from below at some u such that Ui < u < u 2 . 

Hence, we can correspond each exchange-out with an intersection point corresponding to /ij(-) 
intersecting g(-) from below. This implies that the total number of exchage-outs can be bounded 
above by one plus the number of times hi(-) intersects g(-) from below beyond Ui, where Ui is the 
revenue of the assortment A4 t immediately after i is added to it (either through an exchange-in or 
addition). Note that hi(-) intersects g(-) at u > Ui if and only if Wi < wu u -\ and h k ( u \(ui) > hi(ui), 
where k{u) is the product such that k(u) = k t , where u t <u< u t +i- Thus, the number of intersection 
points can be bounded above by the number of products k such that h k (ui) > hi{ui). We now argue 
that i € Bg + i(5c,Ui). If this is true, then it implies that there can be at most \Bg + x(6c,Ui)\ < C(Sc) 
intersection points, which immediately implies that there can be at most 1 + C(5c) exchange-outs. 

The only thing we are left with is to argue that i G Bg+i (5c, v>i). To see this, let M be the assortment 
obtained after i is added or exhanged-in for the first time. Then, according to our definition, we have 
that Ui = R(A4). Further, since H B ( u .)(ui) — Hj^(ui) > 0, there exists a product k G B[ui) \ M. It 
now follows by Proposition [2] that 

hiiui) > h k (ui) - S G Ui > h ls+l{u . } - 5 c u t} 

where is+\(iii) is as defined above i.e., is+i(ui) == argminjgsju.) hj(ui). It now follows by the definition 
of B s+ i(5 c ,Ui) that i G B s+1 (5 c ,Ui). 
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Case 2: Subroutine terminates because no further additions or exchanges increase rev- 
enue. We now consider the case when subroutine terminates at iteration T because no further 
additions or exchanges increase the revenue. Now there are two possibilities: either Af t = AT for all 
t < T or not. In the latter case let T* be the first iteration t when J\f t C W. It then follows from our 
arguments for the above case that there exists an iteration t* < T* that satisfies the properties of the 
lemma. Thus, we consider the case when M t =M for all t<T*. Assuming this, we prove the result 
by contradiction. In particular, suppose at the end of iteration T we have 

H B{u) {u) - H Mt+1 (u) > 2CJ c u, (13) 

Now consider 

H b{u) (u)-H Mt+1 (u)= h J (u)+ (h^u) - hi(u)) , 

keM 3 ieMi,j£M 2 

where as above, Aii denotes the assortment Aix+i \B(v) and the set B(u) \ A4t+i is partitioned into 
Ai 2 U Ai 3 such that Ai 2 n Ai 3 = and | .M 2 1 = I 1 1 ; such a partitioning is possible since | B (u) \ = 
5 + 1 (which follows from ([6]) and the hypothesis that \MgTi =5 + 11) and |A4t+i| < S + 1. It now 
follows that one of the following conditions should hold: either (1) there exists a pair of products 
i G Mi and j E Ai 2 such that h 3 (u) — h^u) > 25 c u, or (2) if Ai 3 7^ 0, then there exists a product 
k G Ai 3 such that h 3 (u) > 25 c u. Otherwise, it is easy to see that H B ^(u) — H Mt+1 (u) < 2C u Scu, 
where C u = \B(u) \A4t+i \ + 1, contradicting (fP3"|) . We consider each of the cases in turn. 

First, suppose that there exist a pair of products i £ Ai± and j € M2 such that hj(u) — hi(u) > 
25 c u. Let A4 denote the assortment (Mt+i \ {i}) U {j}- We can then write 

h m( u ) ~ B Mt+1 (u) = hj(u) - h t {u) > 28 c u. 

Since H Mt+1 (u) = u and 5 C > 5(M), it follows that by Proposition [T] that RAi > RM. T +i- This 
contradicts the assumption that the subroutine terminates with Mt+i because no further additions 
or exchanges result in an increase of revenue. 

Next, suppose Ai 3 ^ and hk{u) > 28cu for some k £ A4 3 . Now let Ai = Aix+i U {A;}; note that 
since Ai 3 ^ 0, it must be that |A4t+i| = 5". We can now write 

H M (u) - H Mt+1 (u) = h k (u) > 25 c u. 

Since H Mt+1 (u) = u and 5 C > 5(Ai), it follows that by Proposition [JJ that RAi > RAir+i- This 
contradicts the assumption that the subroutine terminates with Air+i because no further additions 
or exchanges result in an increase of revenue. This finishes the proof of this case. 
The proof of the lemma now follows. 

5. Summary and discussion 

This paper focused on using choice models to make decisions. Assuming that we have access to a 
revenue prediction subroutine, we designed an algorithm to find an approximation of the optimal 
assortment with as few calls to the revenue subroutine as possible. 

We designed a general algorithm for the optimization of set-functions to solve the static assortment 
optimization algorithms. Most existing algorithms (both exact and approximate) heavily exploit 
the structure of the assumed choice model; consequently, the existing algorithms - even without 
any guarantees - cannot be used with other choice models like the probit model or the mixture of 
MNL models with a continuous mixture. Given these issues, we designed an algorithm that is (a) 
not tailored to specific parametric structures and (b) requires only a subroutine that gives revenue 
estimates for assortments. Our algorithm is a sophisticated form of greedy algorithm, where the 
solution is constructed from a smaller assortment through greedy additions and exchanges. The 
algorithm is proved to find the optimal assortment exactly when the underlying choice model is the 
MNL model. We also showed that the algorithm is robust to errors in the revenue estimates provided 
by the revenue subroutine, as long as the underlying choice model is the MNL model. 
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